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Abstract 

Background: Direct-acting antiviral (DAAs) agents for hepatitis C virus (HCV) span a variety of targets, including 
proteins encoded by the NS3/4A, NS4B, NS5A, and NS5B genes. Treatment with DAAs has been shown to select 
variants with sequence changes in the HCV genome encoding amino acids that may confer resistance to the 
treatment. In order to assess these effects in patients, a Reverse Transcription Polymerase Chain Reaction (RT-PCR) 
method was developed to sequence these regions of HCV from patient plasma. 

Methods: A method was developed to amplify and sequence genotype 1 HCV RNA from patient plasma. 
Optimization of HCV RNA isolation, cDNA synthesis, and nested PCR steps were performed. The optimization of 
HCV RNA isolation, design of RT-PCR primers, optimization of RT-PCR amplification conditions and reagents, and the 
evaluation of the RT-PCR method performance is described. 

Results: The optimized method is able to successfully, accurately, and reproducibly amplify near full-length 
genotype 1 HCV RNA containing a wide range of concentrations (10^ to 10^ lU/mL) with a success rate of 97%. The 
lower limit of detection was determined to be 1000 lU/mL HCV RNA. 

Conclusions: This assay allows viral sequencing of all regions targeted by the most common DAAs currently in 
development, as well as the possibility to determine linkage between variants conferring resistance to multiple 
DAAs used in combination therapy. 

Keywords: Hepatitis C Virus (HCV), Reverse Transcription Polymerase Chain Reaction (RT-PCR), Genotype, Direct 
Acting Antiviral Agents (DAA), Lower Limit of Detection (LLOD), Quasispecies, Sequencing, Resistance, NS 
(Non-stmctural), NS3/4A, NS4B, NS5A, NS5B 
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Background 

More than 170 million people (3% of the world popu- 
lation) are infected with hepatitis C virus (HCV) [1]. 
Chronic HCV infection is a major cause of liver 
cirrhosis and hepatocellular carcinoma (HCC), as well 
as liver failure. Liver failure is one of the most common 
indications for liver transplantation in the United States 
[2,3]- The recent development of potent, direct- acting 
antiviral agents (DAAs), used in combination with 
pegylated-interferon (Peg-IFN) and ribavirin (RBV) has 
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significantly improved the sustained virologic response 
rates compared with Peg-IFN and RBV alone [4,5]. 

HCV has high genetic variability across the six major 
genotypes as well as in different subtypes within each 
genotype. Genotype 1, the predominant cause of chronic 
HCV infections in the United States, Europe, and Japan, 
has more diversity within subtypes la and lb than the 
entire diversity of HBV and HIV [6]. In an HCV- infected 
patient, this pre-existing genetic variability provides a 
pool of variants that can adapt to new selection pres- 
sures, such as antiviral treatment. The variants are a nat- 
ural part of the viral quasispecies, which arise from the 
error-prone nature of the HCV RNA-dependent RNA 
polymerase and the high replication rate of the virus [7]. 
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In patients who fail treatment with a potent DAA regi- 
men, the elimination of the majority of the DAA- 
sensitive quasispecies can lead to selection and detection 
of DAA-resistant variants [8] . 

The nonstructural (NS) HCV proteins NS3, NS4A, 
NS4B, NS5A, and NS5B are essential for viral replication 
and are the major targets of DAAs currently in develop- 
ment [9]. Thus, monitoring viral sequence changes 
across multiple NS regions is beneficial in understanding 
the potential association between therapeutic resistance 
and clinical outcome [10]. 

Various methods for amplification and sequence analysis 
of specific HCV genomes have been reported, and 
most of them have focused on individual NS segments 
or smaller amplicons [11-14]. In 2006, a long reverse 
transcription polymerase chain reaction (RT-PCR) ampli- 
fication protocol of a near full-length HCV genome was 
developed by Fan, X, Xu, Y, and Di Bisceglie, AM, 
that amplified a limited number of patient samples 
[15-17]. WhUe conducting telaprevir (INCIVEK™, 
INCIVO®) cUnical trials [18,19], the need arose to de- 
velop a method to analyze the HCV genotype 1 se- 
quence of NS3, NS4A,NS5A, and NS5B from a single 
genome or the catalytic domain of the NS3»4A protease 
domain from a large number of patients. The challenge 
was that samples from patients are extremely variable 
and existing methods described in the literature required 
patient-by-patient optimization to efficiently amplify 
near full-length HCV RNA, particularly in samples with 
low HCV RNA levels, which were common in patients 
who were responding to telaprevir and/or VX-222. In 
this study, we describe the development of an efficient 
RT-PCR method that was used in telaprevir clinical trials 
to amplify a nearly full-length HCV genotype 1 genome 
from 7800 unique patient samples containing a wide 
range of HCV RNA concentrations (10^ to 10'*IU/mL) 
with a success rate of 97% [4,5,20-23]. This method 
enables HCV genes, targeted by the most common 
DAAs in clinical trials today, to be efficiently sequenced 
using clinical samples with low HCV RNA levels and 
permits the assessment of linkage between multiple 
amino acid changes in different HCV genes in samples 
from patients receiving DAA drug combinations. 

Results and discussion 

Optimization of HCV RNA extraction 

The use of the 96-well format QIAamp Virus BioRobot 
9604 Kit (QIAGEN, Valencia, CA) was optimized to pro- 
duce high-quality viral RNA from HCV RNA-positive 
plasma. Several modifications were made, with the primary 
change involving the replacement of the carrier RNA in the 
RNA isolation. Comparison of the use of poly A RNA (rA) 
provided in the QIAGEN kit with transfer RNA (tRNA) 
(Roche Applied Science, Indianapolis, IN) or ribosomal 



RNA (rRNA) (Roche Applied Science, Indianapolis, IN) 
using plasma samples with a wide range of HCV RNA copy 
numbers (10 ~ 18,500 HCV RNA copies/reaction) showed 
that 60 |ig of tRNA per column for 660 i^L plasma led to 
the highest level of sensitivity in the subsequent amplifica- 
tion reaction. In addition, several other minor changes to 
the standard QIAamp protocol also increased viral RNA 
purity and quality, which helped the performance of subse- 
quent amplifications including mixing via gentie rocking, in- 
stead of pipetting (per the BioRobot 9604 Kit protocol), to 
minimize shearing of the 9.6 kb HCV RNA. Also, for opti- 
mal recovery of viral RNA, the samples were lysed at 56°C 
and an additional spin step was added following the final 
wash step to eliminate residual ethanol carry-over. 

cDNA synthesis 

Synthesis of cDNA is a critical step in the ability to amp- 
lify large PCR products, such as the 9.6 kb HCV gen- 
ome. Optimization of the cDNA synthesis from HCV 
RNA involved testing different oligonucleotide primers 
including random hexamers, oligo d(A) (20 nucleotide), 
and primers that annealed in the NS5B or 3' untrans- 
lated region (UTR). Surprisingly, most of the primers 
binding in the NS5B or 3'UTR did not allow amplifica- 
tion of full-length viral RNA or lead to very low cDNA 
yields. In contrast, oligo dA20, which anneals to the 
highly conserved poly (U) tail at the 3' end of the HCV 
RNA, did allow for high sensitivity and reproducible 
amplification of full-length cDNA. We also tested several 
conditions and concentrations of Reverse Transcriptase 
enzyme (Superscript III Reverse Transcriptase [Life 
Technologies Corporation, Carlsbad, CA]) and it was 
determined that the use of 400 units with a 2.5 hour 
protocol, containing increasing annealing/elongation tem- 
peratures, was preferred. 

Amplification of HCV via nested PCR 

To amplify sufficient amounts of the HCV genome for 
subsequent sequence analysis, PCR primers and condi- 
tions were optimized. Using several hundred publicly 
available genotype 1 HCV sequences (Los Alamos HCV 
Sequence Database [24]), the regions of highest conser- 
vation in the 5' and 3'UTRs were selected for potential 
primer annealing sites. Subsequent evaluation of primers 
to the conserved regions showed a diversity of responses 
in the assay. Many primers resulted in either a poor 
PCR yield, potentially due to the secondary structure of 
HCV RNA in the binding region, or a poorer than 
expected effectiveness across a spectrum of multiple 
samples. The highest performing primer annealing sites 
were at base pair (bp) 242-271 in the 5'UTR and bp 
9278-9305 in the NS5B coding region. When various 
primer lengths were investigated, the use of 28-30 bp 
oligonucleotide primers led to the highest sensitivity and 
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Table 1 List of primers used for the amplification of the 
HCV coding region 


RT primer 


Oligo d(A) 


5'-AAAAAAAAAAAAAAAA 
AAAA-3' 


PCR1 primers Subtype la 


GEN1.F1: 


5'-CAAGACTGGTAGCGGAG 
TAGTGTTGGGTCG-3' 




GEN1A.R1 


5'-CCGGGCAYGAGACAGGCT 
GTGATAAATG-3' 


Subtype 1 b 


GEN1.F1 


5'-CAAGACTGCTAGCGGAGTA 
GTGTTGGGTCG-3' 




GEN1B.R1 


5'-TCGGGCAGGAGACAVGCTG 
TGATATATG-3' 


PCR2 primers Subtype la 


GEN1.F2 


5'-GTAGTGCCTGATAGGGTGGT 
TGCGAGTGCC-3' 




GEN1A.R2 


5'-TCTCCCCCGCTGTAGCCAGC 
GGTGAACC-3' 


Subtype 1 b 


GEN1.F2 


5'-GTACTGCCTGATAGGGTGCT 
TGCGAGTGGC-3' 




GEN1B.R2 


5'-TCTGCCCCGaGTARCCAGC 
RACGAACG-3' 



amplification across a spectrum of varied samples, by 
potentially annealing to viral genomes with several nu- 
cleotide mismatches from the primer. Also, to increase 
the ability to amplify across a spectrum of samples from 
a large number of patients, some degeneracy was added 
in the least conserved nucleotides in each annealing 
region. Differences in the primer annealing regions be- 
tween subtypes la and lb led to the design of subtype- 
specific primers (Table 1). Conditions with the largest 
impact on assay performance included the use of a 
touchdown PCR method (ramping annealing tempera- 
tures) for increased specificity, and addition of a second 
nested PCR reaction to increase the sensitivity of the 
assay to -1000 lU/mL of HCV RNA (Figure 1). The com- 
bination of a highly processive polymerase (KlenTaql ) 
with a proofreading (pfii) polymerase, and addition of 
1.5 M betaine increased the yield of the -9000 bp PCR 
product. Betaine is likely acting through the reduction of 



the bp composition dependence on DNA strand melting 
[25,26]. 

Sensitivity of RT-PCR amplification 

To assess the sensitivity of the method, four genotype 1 
HCV RNA-positive plasma samples (two subtypes la and 
two subtypes lb) were serially diluted with concentrations 
of HCV RNA ranging from 11-649,000 lU/mL, and two 
analysts performed the amplification on different days. 
The HCV RNA-positive plasma samples were processed 
through the RT-PCR step and subsequent PCR amplifica- 
tions were performed in triplicate. Samples with HCV 
RNA greater than or equal to 1000 lU/mL had an amplifi- 
cation success rate of 70% (100 out of 143 samples) in the 
first attempt. Samples with HCV RNA below 1000 lU/mL 
had a much lower amplification success rate of 1.6% (2 out 
of 121 samples). Therefore, the lower limit of detection 
(LLOD) of the RT-PCR amplification method was estab- 
lished to be -1000 lU/mL. No significant differences in 
success rates were observed either between subtype la and 
lb samples or between analysts. 

Robustness and success rate of amplification and 
sequencing 

The performance of the assay was subsequently evaluated 
on a large, diverse set of plasma samples containing HCV 
subtypes la or lb from telaprevir and VX-222 clinical trials, 
with HCV RNA titers ranging from 100 lU/mL to greater 
than 10 lU/mL (Table 2). We processed 7800 unique pa- 
tient plasma samples containing subtype la (5400) and 
subtype lb (2400) with HCV RNA at or above the assay 
limit of detection of 1000 lU/mL from August 2006 to De- 
cember 2010. The genetic diversity of both subtypes la 
and lb sample sets was similar to sample sets in the public 
domain (Los Alamos HCV Sequence Database [24]) as 
shown in Table 3 (Shannon Entropy [27], BioEdit version 
7.0.5.3). The majority (-97%) of subtype la and subtype 
lb samples were successfully amplified, and usually on the 
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Figure 1 Primer locations. RT, PCR1 and PGR2 primers were designed based on H77 for subtype la and Coni for subtype lb, and the relative 
position of tliese primers is sliown. 



Zhang ef al. Virology Journal 2013, 10:53 
http://www.virologyj.eom/content/10/1/53 



Page 4 of 6 



Table 2 Summary of RT-PCR amplification and 
sequencing success rate of clinical samples from 
telaprevir and VX-222 clinical trials 

HCV RNA Genotype/ Numbers Success rate Success rate 
(lU/mL)^ Subtype*" of clinical of RT-PCR of sequencing 
samples amplification % (n/N) 
% (n/N) 



> 1000 la 


5400 


97 (5238/5400) ' 


39.6 (5218/5238) 


lb 


2400 


97 (2328/2400) ' 


39.6 (2318/2328) 


100-1000 la 


152 


48(73/152) ' 


38.6 (73/73) 


lb 


107 


40(43/107) 


100.0 (43/43) 



"■HCV RNA was assessed using the Roche TaqMan HCV RNA assay (Version 2.0). 
^Genotype and subtype was defined using 5'NC assay (Trugene) or Versant 
HCV genotype assay (LiPA) 2.0. 



first attempt. For the remaining 3% of samples that failed 
to amplify, no correlation with HCV RNA level was 
observed. The failure of the assay was likely due to high 
sequence diversity in one or more of the primer binding 
regions, caused by either the natural high variability in 
genotype 1 samples, or mis-genotyping of the samples. 
Testing of samples below the 1000 lU/mL assay limit of 
detection, including samples as low as 100 lU/mL, led to a 
lower success rate (48% and 40% for subtypes la and lb, 
respectively), and were not pursued further. Among the 
amplified samples, nearly 99% of the samples were suc- 
cessfully sequenced with a small set of standardized se- 
quencing oligonucleotides, and the sequence information 
was consistent across replicates. The remaining 1% of 
samples for which sequence was not available had the 
highest sequence diversity in the areas of interest, leading 
to poor annealing for the sequencing oligonucleotides. 

Conclusion 

Despite the high sequence diversity of HCV, successful 
amplification of a near full-length (~9 kb) HCV genotype 
1 genome was completed on 97% (7566/7800) of patient 
plasma samples with HCV RNA concentrations greater 
than or equal to 1000 lU/mL. This method facilitated 
the examination of sequence changes in the viral popu- 
lation during DAA treatment and aided the development 
of telaprevir [28]. The ability to amplify and sequence 



the near full length HCV genome and to look at muta- 
tions across multiple DAA targets will be important in 
the development of multiple DAA-based regimens. 

Methods 

Studies included in these analyses were conducted in full 
compliance with the guidelines of Good Clinical Practice 
and of the World Medical Assembly Declaration of Hel- 
sinki. Prior to study initiation, protocols and informed 
consent forms were reviewed and approved by institu- 
tional review boards at each study site. All patients pro- 
vided written informed consent before participating in 
any study-related activity. ClinicalTrials Identifiers for 
studies included in the manuscript are as follows: 
NCT00336479, NCT00372385, NCT00420784, NCTOO- 
535847, NCT00627926, NCT00983853, NCT00758043, 
NCT00916474, NCT00528528, NCT00703118, NCTOO- 
911963, NCT01080222. 

This method is based on the amplification protocol 
originally described in Kwong et al. [29] . 

HCV RNA extraction 

Viral RNA was extracted from plasma in a 96-well format 
according to the manufacturer's instruction with several 
modifications. Plasma (220 ~ 660 |iL, the volume was 
adjusted based on viral load, i.e., 220 |iL for HCV RNA 
greater than or equal to 50,000 lU/mL, while 660 i^L for 
HCV RNA less than 50,000 lU/mL) was mbced with QIA- 
GEN Protease K (40 ~ 120 ^L) and QIAamp AL buffer 
(240 ~ 720 i^L) supplemented with 20 ~ 60 |ig of carrier 
RNA (tRNA) per column. The mixture was incubated for 
15 min at 56°C, and absolute ethanol (293 ~ 875 \iL) was 
added to each well and mixed well. The mixture was 
loaded into QIAamp column and passed through the col- 
umn by suction with a peristaltic micropump (IPS -16; 
Ismatec, Zurich, Switzerland). Subsequently, the column 
was washed with AWl buffer (1000 ^L) and AW2 buffer 
(1000 |iL) (provided in QIAGEN kit), respectively, by ap- 
plying vacuum. A second wash with AW2 buffer (1000 
I^L) was spun at 6000 x g for 10 min and an additional 
spin at 6000 x g for 15 min was applied to remove residual 
ethanol. 40 |iL of RNA storage solution (Ambion) was 



Table 3 Comparison of HCV NS3(4A) region of Los Alamos and vertex sequences 



Subtype Sequence domain Number of sequences" Length of alignment Mean Shannon Average pairwise difference 

(base pair) entropy (Standard deviation) 



la 


Los Alamos Sequences 


695 


2067 


0.132014 


7.5% (± 1 .9%) 




Vertex Sequences 


2111* 


2085 


0.175243 


9.1% (± 4.6%) 


lb 


Los Alamos Sequences 


561 


2057 


0.158275 


9.3% (± 1 .5%) 




Vertex Sequences 


1335t 


2085 


0.202466 


10.6% (± 3.4%) 



° Sequences of HCV NS3(4A) region. 

* GenBanl< accession numbers: KCl 25008-KCl 271 1 8. 

t GenBanl< accession numbers: KCl 23434-KCl 24768. 
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loaded into each column and incubated at room 
temperature for 5 min. The RNA was eluted by centrifu- 
gation at 6000 x g for 10 min. The elution was repeated 
once to increase the yield. A total of 80 \iL viral RNA was 
isolated from 220 to 660 i^L of plasma. 

cDNA synthesis 

A complementary DNA (cDNA) fragment was synthe- 
sized from HCV RNA that was diluted (1:2 or 1:4, the 
dilution factor was based on viral load, i.e., 1:2 and 1:4 
for HCV RNA less than 50,000 lU/mL, while 1:4 for 
HCV RNA greater than or equal to 50,000 lU/mL) into 
a total RT reaction volume of 20 i^L composed of 2.5 
|iM of an oligo-dA20 primer (Table 1), 400 units of 
Superscript III Reverse Transcriptase (Life Technologies 
Corporation, Carlsbad, CA), 40 units of RNAseOUT (Life 
Technologies Corporation, Carlsbad, CA), PC2 reaction 
buffer (50 mM Tris-HCl pH9.1, 16 mM ammonium 
sulfate, 3.5 mM magnesium chloride, and 150 i^g/mL BSA) 
(AB Peptides, St. Louis, MO), 500 ^M dNTPs (Clontech, 
Mountain View, CA), and 5 mM DTT (Life Technologies 
Corporation, Carlsbad, CA). The RNA was denatured at 
65°C for 5 min, followed by ramping extension tempera- 
tures at 25°C for 10 min, 42°C for 60 min, 50°C for 30 min, 
55°C for 30 min, and 70°C for 15 min. 

Amplification of HCV via nested PCR 

The cDNA from the RT reaction was diluted 1:1 into a 
40 i^L PCRl reaction mixture, containing PC2 reaction 
buffer (AB Peptides, St. Louis, MO), 200 ^M dNTPs 
(Clontech, Mountain View, CA), 1.5 M betaine (Sigma 
Aldrich, St. Louis, MO), 2.56 units Klentaq DNA poly- 
merase (AB Peptides, St. Louis, MO), 1.28 units pfu 
DNA polymerase (Stratagene, La JoUa, CA), and 400 ^M 
each subtype-specific primer (Table 1). The PCRl reac- 
tion was incubated at 94°C for 2 min, followed by 
30 cycles at 94°C for 15 sec, 68°C -0.5°C/cycle ("touch- 
down" PCR) for 20 sec, followed by an incubation at 
68°C for 12 min. The completed PCRl reaction was 
diluted 1:10 into a second PCR mixture (PCR2, 50 ^iL). 
The reaction composition and PCR cycling parameters 
were identical to the PCRl reaction, with the following 
changes; 3.2 units Klentaq DNA polymerase, 1.6 units of 
Pfu DNA polymerase, and nested subtype-specific pri- 
mers were utilized (Table 1). 

Preparation of sample for sequencing 

The PCR2 product (8991 bp) was analyzed and verified by 
agarose gel electrophoresis, and purified using the QIA- 
quick 96 PCR Purification kit (QIAGEN, Valencia, CA). 
The purified PCR2 product was subsequently quantified 
using a NanoDrop 8000 Spectrophotometer (Thermo 
Scientific, Hudson, NH) prior to sequence analysis. 
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